Exemplar-based Voice Quality Analysis and Control using a High Quality Auditory Morphing Procedure based on STRAIGHT
نویسنده
چکیده
This paper tries to introduce a new strategy and tools for voice quality research that complements conventional approaches. A very high-quality speech analysis, modification and synthesis procedure STRAIGHT, which is basically a channel VOCODER based on a pitch-synchronous analysis synthesis framework, was extended to implement auditory morphing in terms of spectral, pitch and voice quality parameters. This extension enables voice quality modification by parametric transformation using STRAIGHT. It also enables an exemplar-based research strategy for perceptual aspects of voice quality analysis and control. In other words, manipulated synthetic voice having virtually equivalent naturalness to natural voice introduces a mean to perform a unique research strategy called systematic downgrading, that is suitable especially for para and nonlinguistic aspects of human vocalization. In addition to morphing procedure, a set of visualization techniques were introduced based on fixed-point analyses in the time and the frequency domain for assisting exploratory data analysis that is indispensable in voice quality research.
منابع مشابه
Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation
An elastic spectral distance measure based on a F0 adaptive pitch synchronous spectral estimation and selective elimination of periodicity interferences, that was developed for a high-quality speech modification procedure STRAIGHT [1], is introduced to provide a basis for auditory morphing. The proposed measure is implemented on a low dimensional piecewise bilinear time-frequency mapping betwee...
متن کاملAcappella synthesis demonstrations using RWC music database
A series of demonstrations of synthesized acappella songs based on an auditory morphing using STRAIGHT [5] will be presented. Singing voice data for morphing were extracted from the RWCmusic database of musical instrument sound. Discussions on a new extension of the morphing procedure to deal with vibrato will be introduced based on the statistical analysis of the database and its effect on syn...
متن کاملProcedure "senza vibrato": a key component for morphing singing
A procedure to remove vibrato from singing voice was proposed to enable auditory morphing between musical performances played under different conditions. Analyses of singing samples in the RWC music database using a speech analysis, modification and synthesis system STRAIGHT provided necessary information to implement “senza vibrato,” the procedure that removes vibrato. A preliminary subjective...
متن کاملAuditory Filterbank Improves Voice Morphing
This paper presents a new method for vocal tract length (VTL) estimation and normalization based on a gammachirp auditory filterbank (GCFB) to improve the sound quality in voice morphing. VTL ratios between 28 speakers were estimated based on the spectral distances for all permutations (756 = 28P27) . The VTL estimation using the mel-frequency filterbank (MFFB), which is a preprocessor for calc...
متن کاملStudy on manipulation method of voice quality based on the vocal tract area function
This paper describes a new manipulation method of voice quality which is based on the STRAIGHT analysis-synthesis system. This method manipulates voice quality by changing the vocal tract area function calculated from the PARCOR coefficients. The PARCOR coefficients used in the proposed method is obtained from the auto-correlation function of the STRAIGHT spectrum. We have implemented a simple ...
متن کامل